Long-range Correlation and Partial 1=f Spectrum in a Non-coding DNA Sequence
نویسندگان
چکیده
Mutual information function, which is an alternative to correlation function for symbolic sequences, and a \symbolic spectrum" are calculated for a human DNA sequence containing mostly intron segments, those that do not code for proteins. It is observed that the mutual information function of this sequence decays very slowly, and the correlation length is extremely long (at least 800 bases). The symbolic spectrum of the sequence at very low frequencies can be approximated by 1=f , where f is the frequency and ranges from 0.5 to 0.85. It is suggested that the existence of the repetitive patterns in the sequence is mainly responsible for the observed long-range correlation. A possible connection between this long-range correlation and those in music notes is also brie y discussed. PACS. 87.10 | General, theoretical, and mathematical biophysics. PACS. 02.50 | Probability theory, stochastic processes, and statistics. Address after September 1, 1991: Physics Department, Rockefeller University, 1230 York Avenue, New York, NY 10021
منابع مشابه
Long - Range Correlation and Partial llf * Spectrum in a Noncoding DNA Sequence
Mutual information function, which is an alternative to correlation function for symbolic sequences, and a esymbolic spectrum, are calculated for a human DNA sequence containing mostly intron segments, those that do not code for proteins. It is observed that the mutual information function of this sequence decays very slowly, and the correlation length is extremely long (at least 800 bases). Th...
متن کاملInvestigation of Polymorphisms in Non-Coding Region of Human Mitochondrial DNA in 31 Iranian Hypertrophic Cardiomyopathy (HCM) Patients
The D-loop region is a hot spot for mitochondrial DNA (mtDNA) alterations, containing two hypervariable segments, HVS-I and HVS-II. In order to identify polymorphic sites and potential genetic background accounting for Hypertrophic CardioMyopathy (HCM) disease, the complete non-coding region of mtDNA from 31 unrelated HCM patients and 45 normal controls were sequenced. The sequences were aligne...
متن کاملThe Role of Long Non Coding RNAs in the Repair of DNA Double Strand Breaks
DNA double strand breaks (DSBs) are abrasions caused in both strands of the DNA duplex following exposure to both exogenous and endogenous conditions. Such abrasions have deleterious effect in cells leading to genome rearrangements and cell death. A number of repair systems including homologous recombination (HR) and non-homologous end-joining (NHEJ) have been evolved to minimize the fatal effe...
متن کاملPhylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467
Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...
متن کاملStatistical properties of DNA sequences.
We review evidence supporting the idea that the DNA sequence in genes containing non-coding regions is correlated, and that the correlation is remarkably long range--indeed, nucleotides thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the "non-stationarity" feature of the sequence of base pa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1992